Explaining predictive models using Shapley values and non-parametric vine copulas
نویسندگان
چکیده
Abstract In this paper the goal is to explain predictions from complex machine learning models. One method that has become very popular during last few years Shapley values. The original development of values for prediction explanation relied on assumption features being described were independent. If in reality are dependent may lead incorrect explanations. Hence, there have recently been attempts appropriately modelling/estimating dependence between features. Although previously proposed methods clearly outperform traditional approach assuming independence, they their weaknesses. we propose two new approaches modelling Both based vine copulas, which flexible tools multivariate non-Gaussian distributions able characterise a wide range dependencies. performance evaluated simulated data sets and real set. experiments demonstrate copula give more accurate approximations true than competitors.
منابع مشابه
Truncation of vine copulas using fit indices
Vine copulas are flexible multivariate dependence models, which are built up from a set of bivariate copulas in different hierarchical levels. However, vine copulas have a computational complexity that is increasing quadratically in the number of variables. This complexity can be reduced by focusing on the sub-class of truncated vine copulas, which use only a limited number of hierarchical leve...
متن کاملSelection of Vine Copulas
Vine copula models have proven themselves as a very flexible class of multivariate copula models with regard to symmetry and tail dependence for pairs of variables. The full specification of a vine model requires the choice of vine tree structure, copula families for each pair copula term and their corresponding parameters. In this survey we discuss the different approaches, both frequentist as...
متن کاملPredictive Interval Models for Non-parametric Regression
Having a regression model, we are interested in finding two-sided intervals that are guaranteed to contain at least a desired proportion of the conditional distribution of the response variable given a specific combination of predictors. We name such intervals predictive intervals. This work presents a new method to find two-sided predictive intervals for non-parametric least squares regression...
متن کاملVine copulas for imputation of monotone non-response
Monotone patterns of non-response may occur in longitudinal studies. When the measured variables are dependent it is beneficial to use their joint statistical model to impute the missing values. We propose to use vine copulas to factorize the density of the observed variables into a cascade of bivariate copulas that yield a flexible model of their joint distribution. The structure of the vine d...
متن کاملTail dependence functions and vine copulas
Tail dependence and conditional tail dependence functions describe, respectively, the tail probabilities and conditional tail probabilities of a copula at various relative scales. The properties as well as the interplay of these two functions are established based upon their homogeneous structures. The extremal dependence of a copula, as described by its extreme value copulas, is shown to be co...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Dependence Modeling
سال: 2021
ISSN: ['2300-2298']
DOI: https://doi.org/10.1515/demo-2021-0103